# High-Precision Quantization
Qwen3 8B GGUF
Apache-2.0
Qwen3 is the latest generation of large language models in the Tongyi Qianwen series, offering a complete suite of dense models and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in reasoning capabilities, instruction following, agent functionalities, and multilingual support.
Large Language Model English
Q
prithivMLmods
1,222
1
Qwen3 1.7B GGUF
Apache-2.0
Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a range of dense and mixture of experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in reasoning, instruction following, agent capabilities, and multilingual support.
Large Language Model English
Q
prithivMLmods
357
1
Qwen Qwen2.5 VL 72B Instruct GGUF
Other
A quantized version of the Qwen2.5-VL-72B-Instruct multimodal large language model, supporting image-text-to-text tasks, suitable for various quantization levels from high precision to low memory requirements.
Text-to-Image English
Q
bartowski
1,336
1
Qwen Qwen2.5 VL 7B Instruct GGUF
Apache-2.0
A quantized version of Qwen2.5-VL-7B-Instruct, using llama.cpp for quantization, supporting multimodal tasks such as image-to-text conversion.
Text-to-Image English
Q
bartowski
2,056
2
Nvidia OpenCodeReasoning Nemotron 32B IOI GGUF
Apache-2.0
This is the quantized version of the NVIDIA OpenCodeReasoning-Nemotron-32B-IOI model, processed using llama.cpp for quantization, suitable for code reasoning tasks.
Large Language Model Supports Multiple Languages
N
bartowski
1,272
2
Nomic Ai Nomic Embed Code GGUF
Apache-2.0
This is the quantized version of the nomic-ai/nomic-embed-code model, using llama.cpp for imatrix quantization, suitable for code embedding and feature extraction tasks.
Text Embedding
N
bartowski
2,109
3
Nomic Embed Code GGUF
Apache-2.0
The Nomic code embedding model is a top-tier code retrieval tool that supports multiple programming languages and excels in code retrieval tasks.
Text Embedding
N
nomic-ai
1,300
4
Gemma 3 27b Tools Q5 K M GGUF
This model is a GGUF format version converted from Gemma-3-27b-tools, suitable for local inference tasks.
Large Language Model
G
attashe
101
1
Mlabonne Gemma 3 4b It Abliterated GGUF
This is a quantized version based on the mlabonne/gemma-3-4b-it-abliterated model, using llama.cpp for imatrix quantization, suitable for image-text-to-text tasks.
Image-to-Text
M
bartowski
9,164
8
Mlabonne Gemma 3 27b It Abliterated GGUF
A quantized version based on Google Gemma 3B model, optimized using llama.cpp, supporting multiple quantization levels, suitable for text generation tasks.
Large Language Model
M
bartowski
7,217
20
Gemma 3 12b It GGUF
Gemma-3-12b-it is a large language model developed by Google, based on the transformer architecture, focusing on text generation tasks.
Large Language Model
G
second-state
583
1
Open R1 OlympicCoder 32B GGUF
Apache-2.0
Quantized version of OlympicCoder-32B, based on llama.cpp's imatrix quantization method, suitable for code generation tasks.
Large Language Model English
O
bartowski
12.60k
12
Qwq 32B Preview IdeaWhiz V1 GGUF
Apache-2.0
A 32B-parameter large language model based on llama.cpp, specializing in text generation tasks for chemistry, biology, climate, and medical fields
Large Language Model English
Q
bartowski
847
3
QVQ 72B Preview AWQ
Other
QVQ-72B-Preview is an experimental research model developed by the Qwen team, focusing on enhancing visual reasoning capabilities. This repository provides its AWQ 4-bit quantized version.
Image-to-Text
Transformers English

Q
kosbu
532
8
Qwen2 VL 2B Instruct GGUF
Apache-2.0
Qwen2-VL-2B-Instruct is a multimodal vision-language model that supports image-text generation tasks, based on the Qwen2 architecture with a parameter scale of 2B.
Image-to-Text English
Q
second-state
125
3
Flan T5 Large Grammar Synthesis Gguf
Apache-2.0
A GGUF-format T5 model for grammar and spelling correction, supporting high-precision quantization versions to ensure correction quality.
Large Language Model English
F
pszemraj
137
2
OPEN SOLAR KO 10.7B GGUF
Apache-2.0
This is a GGUF-format quantized version of the beomi/OPEN-SOLAR-KO-10.7B model, supporting 2-8 bit quantization levels, suitable for Korean and English text generation tasks.
Large Language Model Supports Multiple Languages
O
MaziyarPanahi
86
1
Featured Recommended AI Models